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MAMMALIAN EXPRESSION VECTOR 
1. INTRODUCTION 
The present invention relates to the 
construction and utilization of a DNA plasmid vector, 
5 in particular, pCMX. This vector promotes the 
expression of cloned genes when transfected into 
mammalian cell lines. The functional portions of pCMX 
include the immediate early promoter of human 
cytomegalovirus (HCMV) , a "polylinker" sequence 

10 facilitating the insertional cloning of DNA sequences, 
an RNA splice and polyadenylation signal, origins of 
replication from SV40 (for replication in COS cells) 
and pBR322 (for replication in E. coli ) , and the gene 
encoding ^-lactamase, a selectable marker conferring 

15 resistance to ampicillin in bacterial cells. The 

invention can be used to transiently or stably express 
eukaryotic genes within mammalian cells, that can 
provide an environment conducive to appropriate protein 
folding or other eukaryotic protein modifications such 

20 as glycosylation and oligomerization. 

2. BACKGROUND OF THE INVENTION 
The advent of recombinant DNA technology in 
the 1970 's led to the isolation of a myriad of DNA 

25 sequences that encode useful proteins. The in vitro 
expression of cloned eukaryotic genes in mammalian 
cells is well documented. (for review, see (1) 
Maniatis, et al., 1990, In: Molecular Cloning: A 
Laboratory Manual ; Vol. II, Chapter 16; (2) Bebbington 

30 and Hentschel, 1987, In: DNA Cloning Vol III: A 

Practical Approach, Ed., Glover, O.M. , pp. 163-188; (3) 
Sambrook and Gething, 1988, Focus 10 ( 3 ) : 4 1-48 ) . The 
utilization of eukaryotic expression systems has led to 
an increased understanding of eukaryotic promoter 

35 strength, intron function, RNA splicing and 
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polyadenylation functions as well as transport 
phenomena of newly synthesized polypeptides. However, 
a more practical application of in vitro eukaryotic 
expression systems is the cloning of novel cDNAs as 

5 well as the overproduction and isolation of novel gene 
products. Expression of eukaryotic proteins in 
eukaryotic hosts more readily allows production of 
functional proteins correctly folded and modified 
(e.g., via glycosylation and acetylation, for example). 

Two basic types of mammalian expression 
systems have been developed to date. The first 
involves viral expression vectors modified to express a 
gene of interest (for a review, see Muzyczka, 1989, 
Current Topics in Microbiology and Immunology) . This 

15 modification usually encompasses the replacement of a 
portion of the viral genome with the gene of interest. 
The functions lost in cis from the viral genome are 
complemented by cloning this deleted genomic region 
into a separate "helper plasmid". Therefore, co- 

20 transfection of both recombinant vectors results in 
propagation of the virus as well as overproduction of 
the protein of interest. The second type of mammalian 
gene expression system involves the construction of DNA 
plasmid vectors possessing the capacity to express 

25 cloned inserts in mammalian cells. The expression of 
the cloned gene may occur in a transient, 
extrachromosomal manner or through the stable 
transformation of the respective mammalian host cell 
line. The typical mammalian expression vector will 

30 contain (1) regulatory elements, usually in the form of 
viral promoter or enhancer sequences and characterized 
by a broad host and tissue range; (2) a "polylinker" 
sequence, facilitating the insertion of a DNA fragment 
within the plasmid vector; and (3) the sequences 

35 responsible for intron splicing and polyadenylation of 
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mRNA transcripts. This contiguous region of the 
promoter-polylinker-polyadenylation site is commonly 
referred to as the transcription unit. The vector will 
also contain (4) a selectable marker gene(s) (e.g., the 
5 /3-lactamase gene) , often conferring resistance to an 
antibiotic (such as ampicillin) , allowing selection of 
initial positive transf ormants in E . coli ; and (5) 
sequences facilitating the replication of the vector in 
both bacterial and mammalian hosts. 

^ An example of such an expression vector is 

CDM8 (Seed, 1987, Nature 329: 840-842; Seed and Aruffo, 
1987, Proc. Natl. Acad. Sci. USA 84: 3365-3369; Aruffo 
and Seed, Proc. Natl. Acad. Sci. USA 84: 8573-8577), 
the parental plasmid to the pCMX plasmid of the present 

15 invention. The transcription unit of CDM8 is composed 
of a chimeric promoter (the human cytomegalovirus AD169 
constitutive promoter fused to the T7 RNA polymerase 
promoter) , a polylinker region and the SV4 0 small tumor 
(t) antigen splice and early region polyadenylation 

20 signals derived from pSV2 . The human cytomegalovirus 
(HCMV) promoter is expressed in a variety of mammalian 
cell types, while the T7 bacteriophage DNA-dependent 
RNA polymerase promoter can drive in vitro cell-free 
transcription/translation of cloned inserts. This 

25 particular promoter fusion allows initial experiments 
to be conducted within the confines of the host 
mammalian cell type, while further analysis and 
utilization of the cloned insert may potentially be 
carried out in an in vitro "cell-free" 

30 transcription/ translation system. The const itutively 
expressed HCMV promoter has also been utilized in other 
mammalian expression vectors besides CDM8 (for example, 
see Andersson, et al., 1989, J. Biol. Chem. 
264 (14 ): 8222-8229 ) . Origins of replication in CDM8 

35 include (1) nVX (allowing e.g., replication in E. coli ) 
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(2) SV40 origin (allowing e.g., replication in a 
variety of COS cell types) (3) polyoma origin (allowing 
e.g., replication in polyoma virus transformed mouse 
fibroblasts) and (4) the bacteriophage Ml 3 origin 
5 (allowing e.g., generation of single-stranded template 
for DNA sequence analysis and/or oligonucleotide site- 
directed mutagenesis) . Finally, CDM8 carries the supF 
gene for selection in E. coli . In this antibiotic 
selection system, a CDM8 -based plasmid construction is 

1Q transformed into a specialized E. coli strain 
containing an episome carrying genes encoding 
resistance to the antibiotics, ampicillin and 
tetracyline. However, both genes contain chain 
termination ("nonsense" codon) point mutations 

15 inactivating the resistance phenotype. The supF gene 
product, a nonsense suppressor tRNA, restores the 
resistant phenotype for each antibiotic. Therefore, 
selection is based on growth of the specialized 
episomal-carrying E. coli strain on media containing 

20 ampicillin and tetracycline. Colonies exhibiting this 
phenotype are supposedly transformed with the CDM8- 
based plasmid construction. 

The CDM8 vector is compatible with COS cell 
lines as well as cell lines transformed with the 

25 polyoma virus. COS cell lines are African green monkey 
CV1 cells transformed with an origin-defective SV4 0 
mutant virus. The COS cells produce the large T 
antigen, which is required in trans to promote 
replication of SV4 0 or plasmid constructions such as 

30 CDM8 which contain the respective cis-acting sequences 
initiating viral replication. Therefore, COS cells 
transfected with a CDM8-based construction will support 
replication of the plasmid, resulting in increased 
plasmid copy number and a transient overexpression of 

35 the gene of interest. 
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The major use of CDM8 is cDNA expression 
cloning and overproduction of specific proteins in a 
mammalian in vitro expression system. Expression 
cloning takes on various forms depending on the mode of 
5 detection utilized to identify the cDNA of interest 
(see discussion, infra) • However, the initial step 
consists of isolating mRNA and synthesizing double- 
stranded deoxyribonucleic acid copies of the mRNA 
population (cDNAs) . These cDNAs must be efficiently 

1Q ligated to a plasmid or bacteriophage DNA cloning 

vector and transferred to the appropriate host prior to 
library screening and analysis. CDM8 contains two 
BstXI restriction sites, making it amenable to the 
"adaptor" linker procedure of ligating cDNAs to the 

15 vector, i.e., the use of DNA fragments blunt ended at 
one end (and therefore compatible for ligation with the 
blunt ended cDNA) but containing a non-palindromic 
overhang on the other end (in this instance, compatible 
for ligation with BstXI digested vector DNA, but not 

20 with other cDNAs) . 

A cDNA mammalian expression library may be 
utilized in several ways to pursue the identification 
and isolation of novel cDNAs (see Chapter 16 of 
Maniatis, et al. , supra for a review). Briefly, cDNA 

25 libraries are transfected into the appropriate cell 

lines. A secreted gene product may be identified by a 
variety of assay techniques. Seed (1987, Nature 
329:840-842) utilized the CDM8-based cDNA expression 
system (Seed and Aruffo, 1987, Proc. Natl. Acad. Sci. 

30 USA 84:3365-3369) to select cDNAs encoding novel 
surface membrane proteins. These cDNA expression 
proteins integrated on the cell surface and were 
selected by the ability of that cell type to bind to 
specific antibody coated dishes. Positive cell types 

35 
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were collected, the plasmids rescued and subsequently 
transformed into E. coli for further analysis. 

The main drawback of CDM8 involves several 
problems present when utilizing the supF based 
5 antibiotic system for selecting transformed E. coli 
cells. First, the host E. coli strain displays a 
relatively high frequency of antibiotic resistance/ so 
that there is often an unacceptably high background of 
bacterial colonies lacking plasmid sequences. Second, 

1Q low yields of bacterial plasmid DNA derived from these 
host-specific strains are problematic if reversion 
occurs during the growth of a bacterial culture. 
Third, plasmid preparations frequently become 
contaminated with episomal DNA that contains the genes 

15 conferring resistance to ampicillin and tetracycline. 
Fourth, the requirement of specific genes conferring 
antibiotic resistance encompassed within an episome 
drastically reduces the E. coli host range. 

A second common bacterial vector also 

20 utilized as a parental plasmid to the pCMX plasmid of 
the present invention is pGEM4Z (Promega Bulletin 036 , 
1988) , which contains a pUC derived ^-lactamase gene 
and pBR322 origin of replication. pBR or pUC based 
plasmid constructions ( Yanisch-Perron , et al., 1985, 

25 Gene 33 : 103-109) are widely used bacterial vectors. 

3 . SUMMARY OF THE INVENTION 
The invention relates to the construction of 
a DNA plasmid mammalian expression vector, in 
30 particular, pCMX. The DNA plasmid expression vector of 
the present invention retains the principal advantages 
of plasmid CDM8 without its inherent disadvantages. 
The vector of the present invention is far more 
efficient than CDM8 with regard to the quality and 
35 quantity of transiently and stably expressed proteins, 
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and its use for cDNA expression cloning as well as 
overexpression of genes of interest. 

The functional components of the plasmid of 
the invention are (1) the immediate early promoter of 
5 HCMV, (2) an SV4 0 RNA splice/polyadenylation sequence, 
(3) an SV40 origin of replication, (5) a pBR322 origin 
of replication and (4) a selectable marker conferring 
resistance to an antibiotic- In particular, the 
/3-lactamase gene conferring resistance to the 

10 antibiotic ampicillin can be used. 

The invention can be utilized for, but is not 
limited to, the transient expression of a cloned DNA 
sequence in transfected COS cells. While the examples 
presented utilize a COS cell line, the invention is not 

15 limited to one particular mammalian cell line. Any 
mammalian cell type expressing the large T antigen of 
SV40 (see discussion of COS cell types, supra ) may 
potentially support replication and, hence, expression 
of pCMX-based constructions. Additionally, pCMX 

20 derived constructions may be used to produce mammalian 
cell lines that stably express a recombinant protein. 
For example, pCMX-derived constructions can be co- 
transfected with another plasmid(s) containing an 
appropriate dominant selectable marker (s) to isolate 

25 cell lines overproducing the cloned gene product. 

The invention can be utilized to isolate 
novel cDNAs via expression in transfected mammalian 
cells, i.e., by cDNA expression cloning. This 
technique is exemplified in the isolation and 

30 characterization of a cDNA encoding the ciliary 

neurotrophic factor receptor (CNTFR) using pCMX as a 
mammalian expression vector. cDNA expression cloning 
is a technique whereby a population of purified mRNA is 
transcribed into double stranded complementary DNA 

35 sequences (cDNA) ; ligated into a DNA plasmid expression 
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vector (under the control of a constitutive or 
inducible promoter) ; transferred into the appropriate 
prokaryotic or eukaryotic host to form a representative 
"library" of cDNA clones; screened on the basis of a 
5 characteristic of the protein product encoded by the 
desired cDNA expressed by individual cells within the 
population; thus resulting in positive clones being 
purified, characterized and amplified (see Section 2, 
supra , for additional discussion) • One embodiment of 

10 this technique involves expression of cDNA clones in a 
mammalian cell population and selection of novel cDNAs 
on the ability of the cell type expressing that 
particular cDNA to interact with an antibody or ligand, 
capable of specific binding to the encoded product of 

15 that cDNA, which has been previously bound to a solid 
support such as a petri dish. Positive cDNAs (those 
expressed in cell types binding to the solid support) 
are recovered, transformed into a convenient host 
(e.g., E. coli ) and characterized by known recombinant 

20 DNA techniques. This procedure, referred to as 

"panning" (Wysocki and Sata, 1979, Proc. Natl. Acad. 
Sci. USA 75:2844-2848; Seed and Aruffo, 1987, Proc. 
Natl. Acad. Sci. USA 84:3365-3369) may become a more 
powerful technique due to increased expression of the 

25 cloned cDNA insert as provided by the improved vector 
of the present invention, pCMX. In other words, a cell 
line possessing a low affinity to the antibody or 
ligand (through expression of the cDNA) might yield a 
stronger signal due to increased cDNA expression, thus 

30 culminating in the cloning of novel cDNAs unobtainable 
via the use of previously available cDNA expression 
plasmids. 

In another embodiment, the invention is 
utilized to overproduce a proteinaceous product of a 
35 cloned DNA fragment inserted in a polylinker sequence 
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located between the HCMV promoter and the 3 ' 
splice/polyadenylation signal. In a specific 
embodiment, the invention describes the increased 
expression of an antigenically tagged derivative of 
5 nerve growth factor, designated NGF-myc, in pCMX in 
relation to NGF-myc produced in a parental plasmid 
construction . 



3.1 DEFINITIONS 
1Q The terms listed below, as used herein, will 

have the meanings indicated, 

NGF-myc - nerve growth factor-myc fusion construction 
PCR - polymerase chain reaction 
tRNA - transfer ribonucleic acid 
■J5 DMEM - Dulbecco's modified Eagles medium 

CNTFR - ciliary neurotrophic factor receptor 

MTX - methotrexate 

DRG - dorsal root ganglion 

ITS - insulin-transf errin-selinite 

20 

4. DESCRIPTION OF THE FIGURES 
Figure 1. Nucleotide sequence of pCMX 

Figure 2. Construction of pCMX from parental 

vectors CDM8 and pGEM4Z (see infra for 
25 details) 

Figure 3. Construction of CMX : NGF-myc from 

CDM8: NGF-myc and pCMX (see infra for 
details) 



30 



35 
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10 



15 



Figure 4. Autoradiograph of 35 S-Methionine labeled 

proteins from COS cells transfected with: 
Lane 1: No DNA (Mock Transf ection) 
Lane 2 : CDM8 : NGF-myc 
Lane 3 : CDM8 : NGF-myc 
Lane 4: pCMX: NGF-myc 
Lane 5: pCMX : NGF-myc 
(See infra for details) 

5. DETAILED DESCRIPTION OF T HE INVENTION 

The invention relates to the construction of 
a novel DNA plasmid expression vector , in particular, 
pCMX . The functional components of the plasmid of the 
invention are (1) sequences including the immediate 
early promoter of HCMV, (2) an SV40 RNA 

splice/polyadenylation sequence, (3) an SV40 origin of 
replication, (5) a pBR322 origin of replication and (4) 
a selectable marker conferring resistance to an 
antibiotic. In particular, the ^-lactamase gene 
conferring resistance to the antibiotic ampicillin can 
be used. In a specific embodiment, the immediate early 
promoter of HCMV (a promoter active in mammalian cells) 
is fused to the T7 phage RNA polymerase promoter; this 
embodiment thus allows, e.g., for in vitro cell free 
25 expression under control of the T7 promoter. 

The invention can be utilized to isolate 
novel cDNAs via expression in transfected mammalian 
cells, i.e., by cDNA expression cloning. This 
technique is exemplified in the isolation and 
30 characterization of a cDNA encoding the ciliary 

neurotrophic factor receptor (CNTFR) using pCMX as a 
mammalian expression vector. cDNA expression cloning 
is a technique whereby a population of purified mRNA is 
transcribed into double stranded complementary DNA 
35 sequences (cDNA) ; ligated into a DNA plasmid expression 



20 
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vector (under the control of a constitutive or 
inducible promoter) ; transferred into the appropriate 
prokaryotic or eukaryotic host to form a representative 
"library" of cDNA clones; screened and selected on the 

5 basis of immunological, enzymatic, phenotypic, 

functional, or other characteristics of the protein 
product encoded by the desired cDNA ; resulting in 
positive clones being purified, characterized and 
amplified (see Section 2, supra , for additional 

J0 discussion) . One embodiment of this technique involves 
expression of cDNA clones in a mammalian cell 
population and selection of novel cDNAs on the ability 
of the cell type expressing that particular cDNA to 
interact with an antibody or ligand, capable of 

15 specific binding to the encoded product of that cDNA, 
which has been previously bound to a solid support such 
as a petri dish. Positive cDNAs (those expressed in 
cell types binding to the solid support) are recovered, 
transformed into a more amenable host (e.g., E. coli) 

2Q and characterized by known recombinant DNA techniques. 
This procedure, referred to as "panning" (Wysocki and 
Sata, 1979, Proc. Natl. Acad. Sci. USA 75:2844-2848; 
Seed and Aruffo, 1987, Proc. Natl. Acad. Sci. USA 
84:3365-3369) may become a more powerful technique due 

25 to increased expression of the cloned cDNA insert as 
provided by the improved vector, pCMX. In other words, 
a cell line possessing a low affinity to the antibody 
or ligand (through expression of the cDNA) might yield 
a stronger signal due to increased cDNA expression, 

30 thus culminating in the cloning of novel cDNAs 

unobtainable via the use of previously available cDNA 
expression plasmids . 

Specific embodiments of the invention 
include, but are not limited to, the use of pCMX in 

35 expression cloning in which the cDNA is selected based 
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not on antigenic properties of the expressed protein, 
but on other characteristics such as biological 
activity/ For example, if the expressed product is 
secreted, the selection can be based on the biological 
5 activity of the pooled cell supernatants . Altered cell 
phenotype in transformed cell lines, (see discussion, 
suora ) , etc. can also be the basis for selection. 

In another embodiment, the invention can be 
utilized to overproduce a proteinaceous product of a 

1Q cloned DNA fragment inserted in a polylinker sequence 
located between the HCMV promoter and the 3 ' 
splice/polyadenylation signal. In a specific 
embodiment, the invention describes the increased 
expression of NGF-myc in pCMX in relation to NGF-myc 

15 produced in a parental plasmid construction. 

The invention can be utilized for, but is not 
limited to, the transient expression of a cloned DNA 
sequence in transfected COS cells. While the examples 
presented utilize a COS cell line, the invention is not 

20 limited to one particular mammalian cell line. Any 
mammalian cell type expressing the large T antigen of 
SV40 (see discussion of COS cell types, supra) may 
potentially support replication and, hence, expression 
of pCMX-based constructions. 

25 Additionally , stable mammalian cell lines may 

be chosen in which pCMX derived constructions are co- 
transfected with another plasmid (s) containing an 
appropriate dominant selectable marker (s) utilized to 
isolate cell lines overproducing the cloned gene 

30 product. By way of example but not limitation, CHO 
cells deficient in both alleles of the dihydrof olate 
reductase (dhfr) gene (Urlaub and Chasin, 1980, Proc. 
Natl. Acad. Sci. USA 77:4216-4220) may be co- 
transf ected with a pCMX-based construction and a 

35 plasmid carrying the dhfr gene under control of a weak 
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promoter. The resulting co-transf ormants will result 
in individual colonies resistant to the selective 
pressure of nucleoside-f ree media. Such colonies are 
then treated with methotrexate (MTX) to induce gene 
5 amplification (Alt, et al., 1978, J. Biol. Chem. 

253:1357-1370). MTX-resistant individual colonies may 
then be isolated and screened for an amplified CHO 
transformant overexpressing the cloned insert of 
interest. In another embodiment, 3T3 cells may be 

10 transfected for stable expression. 

In the present invention, the plasmid pCMX 
supports expression of a cloned DNA sequence. The 
invention is illustrated by way of working examples in 
which the expression of NGF-myc in both pCMX and CDM8 

15 demonstrates the increased utility of pCMX to express 
eukaryotic genes in mammalian cells. The invention is 
not limited to the expression of NGF-myc, but to any 
genomic, cDNA or other DNA sequence inserted 3' to the 
aforementioned promoter element in such a fashion as to 

20 result in the synthesis of a stable RNA transcript. In 
a further embodiment of the invention, the presence of 
a functional T7 phage RNA polymerase promoter permits 
use of the pCMX vector outside the confines of a host 
mammalian cell. This promoter drives in vitro cell- 

25 free transcription/translation of DNA inserts cloned 
downstream from the HCMV/T7 promoter fusion. 

The transcription unit of pCMX is unchanged 
from CDM8. The HCMV promoter constitutively expresses 
downstream DNA sequences. A polylinker region 

30 containing multiple cloning sites facilitates insertion 
cloning of the DNA sequence of interest proximal to the 
HCMV promoter. Further downstream is an SV4 0 
splice/polyadenylation signal. Additionally, both pCMX 
and CDM8 contain an SV4 0 origin of replication and 

35 pBR322 origin of replication. The former origin 
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promotes replication in mammalian cells (e.g., COS 
cells) expressing the SV40 T antigen, while the latter 
origin promotes replication in E. coli cells. 

Four substantial differences exist between 
5 pCMX and its parent plasmid, CDM8 • First, the gene 
encoding jS-lactamase (conferring resistance to 
ampicillin) replaces the tRNA amber suppressor gene as 
the selectable marker in E. coli . The /J-lactamase gene 
was derived from pGEM4Z (a plasmid commercially 

10 available from Promega Corp.). The portion of pGEM4Z 
utilized in the construction of pCMX is also found in a 
previous series of plasmids, such as pUC (Yanisch- 
Perron, et al., 1985, Gene 33 : 103-109 ) based 
derivatives. Second, as depicted in Figure 2, the 7rVX 

15 plasmid origin of replication in CDM8, used to permit 
episomal replication in E. coli , has been replaced with 
another plasmid origin of replication (originally 
derived from plasmid pBR322) . This origin was 
introduced together with the ^-lactamase gene from 

20 pGEM4Z using PCR. Third, the origin of replication 
from the single-stranded bacteriophage, M13, has been 
removed. The Ml 3 origin of replication contains 
cis-acting sequences promoting single-stranded DNA 
production from double stranded plasmid template. 

25 Single stranded DNA has been utilized for years as the 
template of choice for the dideoxy method of DNA 
sequencing. However, recent advances in recombinant 
DNA techniques allow for direct and efficient 
sequencing from a double stranded plasmid template. 

30 Third, the polyoma origin of replication has been 
removed. This origin of replication allows for 
propagation of plasmids in mammalian cell lines 
infected with the polyoma virus. The majority of 
applications involving cDNA expression cloning, as well 

35 as the transient and stable expression of genes of 
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interest, in mammalian cell lines are adequately met 
utilizing COS cell types. Therefore, the polyoma 
origin was deemed unnecessary and was removed during 
the construction of pCMX. Replacement of the tRNA 

5 amber suppressor gene (2 00 bp) with the ^-lactamase 
gene and pBR origin of replication (1050 bp) and 
removal of the M13 origin (600 bp) and polyoma origin 
(800 bp) result in a pCMX vector that is 550 bp smaller 
(12% reduction) than CDM8 (Figure 2). 

g The vector, pCMX, allows for cDNA expression 

cloning, as well as the overexpression of genes of 
interest in mammalian cell lines. 



15 6. EXAMPLES 

6.1. MATERIALS AND METHODS 
6.1.1. CONSTRUCTION OF PCMX 
The eukaryotic expression plasmid, pCMX 
(Figure 1) , was constructed as described schematically 

20 in Figure 2. Briefly, an Sstl and a BamHI restriction 
site were generated flanking the SV40 origin of 
replication via the polymerase chain reaction (PCR) 
(Figure 2; primer C and D) . The PCR product was 
isolated and digested with Sstl/BamHI to create a 

25 fragment flanked by these two sites. This fragment 
contains the SV4 0 origin of replication from CDM8 but 
lacks the polyoma origin, M13 origin and tRNA 
suppressor gene (supF) contained within the BamHI/MluI 
fragment of CDM8 . Primers A and B were used with 

30 pGEM4Z to generate a PCR product that contains the /3- 
lactamase gene and the pBR322 origin of replication. 
Primer A introduces an Mlul site while primer B 
introduces an Sstl site. This MluI/SstI fragment was 
introduced into a triple ligation with the Sstl/BamHI 

35 fragment containing the SV4 0 origin of replication and 
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CDM8 digested with BamHI/MluI (which comprises the 
transcriptional unit) ; resulting in pCMX (Figure 2). 

6.1.2 CONSTRUCTTON OF CDMH : NGF-M VC AND pCMXlNGF-MYC 
■ The patch PCR technique was used to generate 

a PCR product containing the mouse NGF gene (kindly 
provided by Drs. E. Shooter and U. Suter, Stanford 
University) linked, through a bridge encoding two 
glycines, to a sequence encoding the 10 amino acid myc 
TO epitope (Squinto, et al., 1990, Neuron 5:757-766). The 
primers were designed to result in a PCR product in 
which the last two codons of the native NGF gene were 
deleted due to possibility that the amino acids encoded 
by these codons would represent a proteolytic cleavage 
site that would result in the loss of the myc epitope. 
The PCR product was then digested with Xhol and 
subcloned into the CDM8 expression plasmid to generate 
the expression plasmid CDM8 : NGF— myc (Figure 3) as 
deposited with the ATCC under the accession number 
40825 and strain designation pC8-LmN-3 ' MYC. The NGF- 
myc insert was excised from CDM8 with the enzymes Xhol 
and NotI and subcloned into XhoI/Notl-digested pCMX, 
yielding the plasmid pCMX:NGF-myc (Figure 3). 



15 



20 



25 



6 13. TRANSFECTION OF COSM5 CELLS WITH 
pCMX : NGF— MYC AND CDM8 : NGF— MYC 

COSM5 cells were transfected using a DEAE- 

dextran protocol (Maniatis, et al, supra). Twenty four 

hours prior to transf ection, exponentially growing 

COSM5 cells were harvested by trypsinization and 

30 replated at a density of 1.5 x 10 5 cells/35 mm dish in 

complete growth media. The cultures were incubated for 

approximately 24 hours at 37C° in a 5% C0 2 incubator. 

For a 35 mm dish, the medium was aspirated off and 1ml 

oc of transf ection media was immediately added to the 
35 
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culture dish (transf ection media is: RPMI (serum free)/ 
400 Hg/ml DEAE/ 100 /xM chloroquine / 1 x glutamine/ 
ix ITS [insulin-transferrin-selinite] ) . Ten nanograms 
to 10 micrograms of DNA to be transf ected was added, 
the culture dish swirled several times and placed at 
37 °C in a 5% C0 2 incubator for 3-3.5 hr. Cells 
routinely appeared slightly vacuolated subsequent to 
the incubation. The transf ection medium was replaced 
with an equal volume of PBS/ 10% DMSO for 2-3 minutes. 
This medium was then replaced with growth medium (10% 
FBS in Dulbecco's modified Eagles medium [ DMEM ] ) and 
incubated further at 37°C at 5% C0 2 for 2-3 days. 



6.1.4. 35 S-METHIONINE LABELLING OF 
POLYPEPTIDES IN TRANSFECTED 
15 C0SM5 CELLS 

Approximately 5 x 10 5 COSM5 cells/60mm dish 

were plated in 2 . 5 ml of growth medium containing 10% 

FBS following trypsin-EDTA treatment. The cells were 

then transf ected with 10 fig of either CDM8 : NGF-myc DNA, 

20 pCMX : NGF-ray c DNA or CDM8 DNA utilizing the DEAE-dextran 
protocol as described, supra . The transfected cells 
were incubated for 48-72 hours at 37°C in a 5% C0 2 
incubator. The supernatant was removed, cells were 
washed once with 1 x PBS (lacking Ca/Mg) , and once with 

25 5 ml DMEM without methionine. One milliliter of DMEM 
(without methionine) containing 1 x ITS was added to 
the cells and incubation was continued at 37 °C in a 5% 
C0 2 incubator for 1 hour. To the medium was added 100 
/xCi of 35 S-methionine and incubation was continued for 4 

30 hours under identical conditions. Debris was removed 
by microcentrifugation, and the supernatant was 
collected and transferred to a fresh, sterile eppendorf 
tube and either prepared for electrophoresis or stored 
at -70°C for a period of up to two weeks. Samples were 

35 
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15 



20 



25 



30 



prepared for electrophoresis -through a 15% 
polyacrylamide-SDS gel by standard techniques (see 
Maniatis, et.al; supra) . 

6.1.5. BIOASSAY OF pCMX:NGF-MYC AND 

CDM8 : NGF-MYC COS CELL EXTRACTS 
TN DORSAL ROOT GANGLION EXPLANTS 

COSM5 cells were transfected with either 
pCMX:NGF-myc DNA, CDM8 : NGF-myc DNA or mock-transf ected 
(no DNA) were collected by centrif ugation . Cell 
supernatants were prepared. Biological activity of the 
supernatants were assayed on explants of chick embryo 
dorsal root ganglia (DRG) as described in Lindsay and 
Rohrer (1985, Devel. Biol. 112:30). Briefly, DRG were 
dissected from chick embryos of 10 days incubation 
(E10) and 5-6 ganglia were explanted in 1 ml of a 
collagen gel matrix in 35 mm tissue culture dishes. 
After the gel had set, 1 ml of tissue culture growth 
medium F14 (Imperial Labs. U.K.) supplemented with 5% 
heat-inactivated horse serum (GIBCO) was added before 
addition of varying amounts of transfected COS cell 
supernatants. In explant cultures, the activity of 
respective COS cell extracts was determined by 
assessing the extent of fiber outgrowth in treated 
cultures compared to controls. Fiber outgrowth was 
scored on a scale of 0 to 5+, by comparing cultures to 
photographs of a dose-response of explanted DRG and 
cultured with increasing amounts of NGF. In all cases 
results were derived from triplicate cultures. 

6.1.6. CONSTRUCTION OF A CNTF-RECEPTOR 
EXPRESSION LIBRARY 

SH-SY5.Y cells (originally obtained from Dr. 

June Biedler) were used as a source of mRNA for 

construction of a cDNA library using the pCMX 
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expression vector. Inserts for the cDNA library (see 
Section 2 for discussion) were selected on an agarose 
gel for sizes larger than 1 kb. The expression library 
was utilized to evaluate the efficiency of pCMX in 
5 expression cloning. As a working example, but not a 
limitation, cDNA clones expressing the ciliary 
neurotrophic factor receptor gene were isolated ( infra , 
Section 6.1.8) . 

1Q 6.1.7. "PANNING" METHOD 

The "panning" method developed by Seed and 
Aruffo (1987, Proc. Natl. Acad. Sci. U.S.A. 84 ; 3365- 
3369) was modified as follows: Instead of incubating 
the cells with antibodies recognizing the receptor, 

15 cells were incubated first with a modified form of rat 
CNTF engineered to contain a polypeptide epitope of the 
human C-myc protein (designated CNTF-myc) . The ligand 
was used at 1 /xg/ml and incubation was carried out on 
ice for 30 minutes; cells were then spun through PBS/2% 

20 Ficoll to remove excess ligand, and then incubated with 
the anti-myc monoclonal antibody 9E10 (obtained from 
Oncogene Sciences, Manhasset, N.Y.) for 30 minutes on 
ice. This was followed by another spin through PBS/2% 
Ficoll and "panning" on plates coated with goat anti- 

25 mouse IgG antibody obtained from Sigma. The plates 

were prepared as follows: Bacteriological 60 mm plates 
(Falcon 1007 or the equivalent) , or 10 cm dishes such 
as Fisher 8-757-12 were coated with goat anti-mouse 
antibody, diluted to 10 /zg/ml per ml in 50 mM Tris-HCl, 

30 pH 9.5. Three ml of antibody was used to coat each 
6 cm dish, or 10 ml was used per 10 cm dish; plates 
were exposed to antibody for about 1.5 hours, then 
antibody was removed to the next dish, allowed to stand 
for 1.5 hours, and then removed again to a third dish. 

35 Plates were washed three times with 0.15 M NaCl (a wash 
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bottle is convenient for this), and incubated with 3 ml 
of 1 mg/ml BSA in PBS overnight. In particular, 
"panning" was performed as follows: Cells were 
cultured in 60 mm dishes. Medium was aspirated from 
5 each dish, and 2 ml PBS/0.5 mM EDTA/0.02% azide was 
added and the mixture was incubated at 37° for 30 min. 
to detach cells from the dish. The cells were 
triturated vigorously with a short pasteur pipet, 
collected from each dish in a centrifuge tube, and spun 

1Q 4 min. at 200 x g. Cells were resuspended in 0.5-1.0 
ml PBS/EDTA/azide/5% FBS and incubated with CNTFrmyc 
for 30 min. on ice. An equal volume of PBS / EDTA /azide 
was layered carefully on 3 ml PBS /EDTA/ azide/ 2% Ficoll, 
spun 4 min. at 200 x g, and the supernatant was 

15 aspirated in one smooth movement. The cells were then 
incubated with 9E10 antibody for 30 minutes on ice, and 
the spin through PBS/EDTA/azide/2% Ficoll was repeated. 
The cells were taken up in 0.5 ml PBS/EDTA/azide/5% FBS 
by pipetting through 100 micron Nylon mesh (Tetko) . 

2 Q Cells were added from at most two 60 mm dishes to one 
60 mm goat anti-mouse Ig antibody coated plate r and 
allowed to sit at room temperature 1-3 hours. Excess 
cells not adhering to dish were removed by gentle 
washing with PBS/ 5% serum or with medium (2 or 3 washes 

25 of 3 ml were usually sufficient). 

6.1.8. IDENTIFICATION OF CLONES CONTAINING THE 

CILIARY NEUROTROPHIC FACTOR RECEPTOR GENE 

Plasmid DNA from the expression library was 

transf ected into COSM5 cells (approximately 250-500 ng 

per 100 mm dish; 2 dishes were transf ected) , using 

DEAE/chloroquine according to standard procedures. Two 

days after transf ection, cells were detached from their 

dishes and subjected to the Aruf f o/Seed panning 

procedure modified as described supra . 
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After washing nonadhering cells from the 
plates, Hirt supernatants were prepared, and plasmid 
DNA was precipitated in the presence of 10-2 0 m<? °f 
tRNA. The resulting DNA was introduced into DH10B 
5 bacteria by electroporation (Electromax, BRL) according 
to the manufacturer's instructions. Cultures grown 
from the electroporated bacteria were used to prepare 
plasmid DNA for another round of transfection and 
panning; a plate of COS cells transfected with this 

1Q plasmid DNA clearly revealed a large number of COS 
cells expressing the CNTFR by an indirect iodinated- 
antibody binding assay (carried out according to 
Section 6.1.9). After a second round of 
panning/plasmid DNA isolation/electroporation on these 

15 transf ectants, the bacterial transf ormants resulting 
from the electroporation step were plated out on 
ampicillin plates. Fifteen individual bacterial 
colonies were picked, and plasmid DNA prepared from 
each of the clones were transfected individually into 

20 COS cells for assay. 

6.1.9. INDIRECT 125 I GOAT ANTI-MOUSE 

ANTIBODY BINDING ASSAY 

COS cells were transfected with plasmid DNA from 

the library, the enriched library, or individual 

25 

clones. After 48 hours, cells were incubated 
sequentially for 30 minutes on ice with PBS (with Ca, 
Mg)/5% FBS containing: 

1) 1 Mg/ml CNTF-myc; 

2) 10 /ug/ral 9E10; 

30 

3) n5 I goat anti-mouse antibody (GaM) (0.5-1 
jzCi/ml) . 

Cells were washed 3x5 minutes in PBS/5% FBS after 
each step. After the last wash, the plates were 
qK autoradiographed . 
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For the individual clones, a quantitative estimate 
of total radioactivity bound was made with a hand-held 
gamma counter. 

6.1.10. FLUORESCENCE ACTIVATED CELL-SORTING ANALYSIS 

Transfected COS cells were incubated sequentially 
with CNTF-myc, 9E10 antibody/ and FITC-labelled goat 
anti-mouse antibody. Then they were detached from 
dishes and subjected to FACS analysis. COS cells 
transfected with a CNTF-receptor expressing plasmid 
contain a large subpopulation displaying greatly 
increased fluorescence by this assay. 



10 



15 



20 



25 



30 



6.2. RESULTS 

6.2.1 COMPARATIVE EXPRESSION OF 

DCMX : NGF-MYC AND CDM8 : NGF-MYC 

35 S-Methionine labelled COSM5 cells were 
collected as described, supra . Equal amounts of COSM5 
cell supernatants from mock transfected cells (Figure 
4, lane 1), cells transfected with CDM8 :NGF-myc DNA 
(Figure 4, lanes 2 4-3) or pCMX:NGF-myc DNA (lanes 4 + 
5) were electrophoresed through a 15% polyacrylamide- 
SDS gel and subjected to autoradiography. The auto- 
radiograph was analyzed by scanning densitometry. The 
level of expression of NGF-myc in COSM5 cells trans- 
fected with pCMX: NGF-myc was approximately 8-10 fold 
higher than in COSM5 cells transfected with CDM8: NGF- 
myc (compare Lanes 4+5 with lanes 2 + 3 [see arrow]). 
Also, note the complete lack of NGF-myc expression in 
C0SM5 cells transfected with pCMX minus the NGF-myc 
insert. 

Bioactivity of C0SM5 cell supernatants on DRG 
explants is presented in Table I. 
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Table I. 

Bioacfcivity of COS cell supernatants on DRG explants 

Neurite outgrowth 



Positive Control* 5+,5,4,4,4 

Mock Transfected 

50 Ml 0,0,0,1/2,1/2 

10 

CDM8 : NGF— myc : 

50 Hi 5+, 5+, 5+, 5+, 1 

20 fXl 5,5,5,5,5 

10 All 2,3,3,4,4 

j5 5 Hi 2,3,2,2,3 

1 Ml 1,1,1,2,2 

CMX : NGF-myc : 

50 /ul 5+, 5+, 5+, 5+, 5+ 

2 0 20 Ml 5+,5+,5+,5+,5+ 

10 Ml 5,5,5,5,5+ 

5 Ml 3,4,4,4,4 

1 Ml 1,1,1,2,2 

25 *NGF at 5 ng/ml 
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Increasing amounts of extract from pCMX:NGF- 
myc and CDM8:NGF-myc were tested. The most notable 
effect was observed when either 5 jil or 10 /xl of either 
5 supernatant is tested. In such a case, approximately a 
two to four-fold increase in NGF-myc activity was 
observed in supernatant s from CMX: NGF-myc transfected 
COSM5 cells, as compared with the pCDM8 : NGF-myc 
transfected COSM5 cell supernatants . 

Therefore, expression of stable protein as 
well as the biological activity of said protein was 
substantially greater in pCMX: NGF-myc transfected cells 
than with CDM8 : NGF-myc transfected cells, 

15 6.2.2 ISOLATION OF cDNA ENCODING CNTFR 

Out of 15 plasmids tested as described supra, 
14 resulted in transfected COS cells expressing CNTF 
binding sites as determined by a variety of assays/ 
including the indirect antibody binding assay and 

20 fluorescence activated cell sorting (FACS) analysis 
described infra. 

Restriction analysis of the 14 positive 
clones showed that they fell into four classes. 
Members of each class produced an identical pattern of 

25 bands on digestion with the enzyme Pstl. Further 

restriction analysis revealed that the four classes of 
clones overlapped, and preliminary sequence data 
confirmed that they shared overlapping sequences at 
their 5 1 ends. 

30 To characterize the proteins coded for by the four 

classes of clones, they were all transcribed from the 
T7 promoter in the 5' region of the vector polylinker. 
After in vitro translation, the products were 
electrophoresed on a polyacrylamide gel. Class (a) 

35 produced no protein, since it is in the wrong 
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orientation with respect to the T7 promoter. The other 
three classes all produced proteins of identical sizes 
(approximately 42 kd) , verifying that they encoded the 
same protein. This demonstrates that pCMX performed 
5 reliably in the isolation of 4 independent clones of 
the CNTFR. Additional results confirmed the existence 
of cDNAs encoding the CNTFR, as shown by indirect 
antibody binding assay and fluoresence activated cell 
sorting " (FACS) " analysis (see copending application 
10 entitled "Ciliary Neurotrophic Factor Receptor" filed 
on even date herewith, and incorporated by reference 
herein) . 

7 . DEPOSIT OF MICROORGANISMS 
The following microorganisms were deposited 
with the Agricultural Research Culture Collection 
(NRRL) , Northern Regional Research Center, 1815 North 
University St., Peoria, IL, 61604, on March 26, 1991 
and given accession numbers as follows: 

Strain Plasmid Accession No. 

E. coli DHSaFl carrying pCMX B-18790 

The plasmid DNA expression vector CDM8 : NGF-myc was 
25 previously deposited with the ATCC with the accession 
numbers 40825 (strain designation pC8-LmN-3 ' MYC) and 
40864 (strain designation pC8 /mN/MYC-NMl ) . 

The present invention is not to be limited in 
scope by the microorganisms deposited or the 
30 embodiments disclosed in the examples which are 
intended as illustrations of a few aspects of the 
invention, and any embodiments which are functionally 
equivalent are within the scope of this invention. 
Indeed, various modifications of the invention in 
35 addition to those shown and described herein will 
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become apparent to those skilled in the art and are 
intended to fall within the scope of the appended 
claims. 

Various references are cited herein, the 
disclosures of which are incorporated by reference in 
their entireties. 
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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Davis, Sam 

Yancopoulos, George D . 

(ii) TITLE OF INVENTION : Mammalian Expression Vector 

(iii) NUMBER OF SEQUENCES: 1 



(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Pennie & Edmonds 

(B) STREET: 1155 Avenue of the Americas 

(C) CITY: New York 

(D) STATE: N.Y. 
(F) ZIP: 10036 



(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS /MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1-25 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 



(viii) ATTORNEY /AGENT INFORMATION: 

(A) NAME: Misrock, S.Leslie 

(B) REGISTRATION NUMBER: 18 , 872 

(C) REFERENCE/DOCKET NUMBER: 6526-052-999 



(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 212 790-9090 

(B) TELEFAX: 212 869-9741 



(2) INFORMATION FOR SEQ ID NO:l: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3944 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 



(ii) MOLECULE TYPE: cDNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 



ACGCGTTGAC 


ATTGATTATT 


GACTAGTTAT 


TAATAGTAAT 


CAATTACGGG 


GTCATTAGTT 


60 


CAT AG C CCAT 


ATATGGAGTT 


CCGCGTTACA 


TAACTTACGG 


TAAATGGCCC 


CGCCTGGCTG 


120 


ACCGCCCAAC 


GACCCCCGCC 


CATTGACGTC 


AATAATGACG 


TATGTTCCCA 


TAGTAACGCC 


180 


AATAGGGACT 


TTCCATTGAC 


GTCAATGGGT 


GGACTATTTA 


CGGTAAACTG 


CCCACTTGGC 


240 


AG T A C ATCAA 


GTGTATCATA 


TGCCAAGTAC 


GCCCCCTATT 


GACGTCAATG 


ACGGTAAATG 


300 


GCCCGCCTGG 


CATTATG CCC 


AGTACATGAC 


CTTATGGGAC 


TTTCCTACTT 


GGCAGTACAT 


360 


CTACGTATTA 


GTCATCGCTA 


TTACCATGGT 


GATGCGGTTT 


TGGCAGTACA 


TCAATGGGCG 


420 


TGGATAGCGG 


TTTGACTCAC 


GGGGATTTCC 


AAGTCTCCAC 


CCCATTGACG 


TCAATGGGAG 


480 



WO 92/1 758 1 PCI7US92/02492 

- 28 - 



TTTGTTTTGG 


CACCAAAATC 


AACGGGACTT 


TCCAAAATGT 


CGTAACAACT 


CCGCCCCATT 


540 


GACGCAAATG 


GGCGGTAGGC 


GTGTACGGTG 


GGAGGTCTAT 


ATAAGCAGAG 


CTCTCTGGCT 


600 


AACTAGAGAA 


CCCACTGCTT 


ACTGGCTTAT 


CGAAATTAAT 


ACGACTCACT 


ATAGGGAGAC 


660 


CGGAAGCTTC 


TAGAGATCCC 


TCGACCTCGA 


GATCCATTGT 


GCTGGCGCGG 


ATTCTTTATC 


720 


ACTGATAAGT 


TGGTGGACAT 


ATTATGTTTA 


TCAGTGATAA 


AGTGTCAAGC 


ATGACAAAGT 


780 


TGCAGCCGAA 


TACAGTGATC 


CGTGCCGCCC 


TAGACCTGTT 


GAACGAGGTC 


GGCGTAGACG 


840 


GTCTGACGAC 


ACGCAAACTG 


GCGGAACGGT 


TGGGGGTTCA 


GCAGCCGGCG 


CTTTACTGGC 


900 


ACTTCAGGAA 


CAAGCGGGCG 


CTGCTCGACG 


CACTGGCCGA 


AGCCATGCTG 


GCGGAGAATC 


960 


ATAGCACTTC 


GGTGCCGAGA 


GCCGACGACG 


ACTGGCGCTC 


ATTTCTGACT 


GGGAATGCCC 


1020 


GCAGCTTCAG 


GCAGGCGCTG 


CTCGCCTACC 


GCCAGCACAA 


TGGATCTCGA 


GGGATCTTCC 


1080 


ATACCTACCA 


GTTCTGCGCC 


TGCAGGTCGC 


GGCCGCGACT 


CTAGAGGATC 


TTTGTGAAGG 


1140 


AACCTTACTT 


CTGTGGTGTG 


ACATAATTGG 


ACAAACTACC 


TACAGAGATT 


TAAAGCTCTA 


1200 


AGGTAAATAT 


AAAATTTTTA 


AG TG TATAAT 


GTGTTAAACT 


ACTGATTCTA 


ATTGTTTGTG 


1260 


TATTTTAG AT 


TCCAACCTAT 


GGAACTGATG 


AATGGGAGCA 


GTGGTGGAAT 


GCCTTTAATG 


1320 


AGGAAAACCT 


GTTTTGCTCA 


GAAGAAATGC 


CATCTAGTGA 


TGATGAGGCT 


ACTG CTG ACT 


1380 


CTCAACATTC 


TACTCCTCCA 


AAAAAGAAGA 


GAAAGGTAGA 


AGACCCCAAG 


GACTTTCCTT 


1440 


CAGAATTGCT 


AAGTTTTTTG 


AGTCATGCTG 


TGTTTAGTAA 


TAGAACTCTT 


GCTTGCTTTG 


1500 


CTATTTACAC 


CACAAAGGAA 


AAAG CTGCAC 


TGCTATACAA 


GAAAATTATG 


GAAAAATATT 


1560 


CTGTAACCTT 


TATAAGTAGG 


CATAACAGTT 


ATAATCATAA 


CATACTGTTT 


TTTCTTACTC 


1620 


CACACAGGCA 


TAGAGTGTCT 


GCTATTAATA 


ACTATGCTCA 


AAAATTGTGT 


ACCTTTAGCT 


1680 


TTTTAATTTG 


TAAAGGGGTT 


AATAAGGAAT 


ATTTGATGTA 


TAGTGCCTTG 


ACTAGAGATC 


1740 


ATAATCAGCC 


ATACCACATT 


TGTAGAGGTT 


TTACTTGCTT 


TAAAAAACCT 


CCCACACCTC 


1800 


CCCCTGAACC 


TGAAACATAA 


AATGAATGCA 


ATTGTTGTTG 


TTAACTTGTT 


TATTGCAGCT 


1860 


TATAATGGTT 


ACAAATAAAG 


CAATAGCATC 


ACAAATTTCA 


CAAATAAAGC 


ATTTTTTTCA 


1920 


CTGCATTCTA 


GTTGTGGTTT 


GTCCAAACTC 


ATCAATGTAT 


CTTATCATGT 


CTGGATCCTT 


1980 


ACTGCGCCCA 


TCCCGCCCCT 


AACTCCGCCC 


AGTTCCGCCC 


ATTCTCCGCC 


CCATGG CTG A 


2040 


CTAATTTTTT 


TTATTTATGC 


AGAGGCCGAG 


GCCGCCTCGG 


CCTCTGAGCT 


ATTCCAGAAG 


2100 


TAGTGAGGAG 


GCTTTTTTGG 


AGGCCTAGGC 


TTTTGCAAAA 


AGGAGCTCCC 


AGCAAAAGGC 


2160 


CAGGAACCGT 


AAAAAGGCCG 


CGTTGCTGGC 


GTTTTTCCAT 


AGGCTCCGCC 


CCCCTGACGA 


2220 


G CATC AC AAA 


AATCGACGCT 


CAAGTCAGAG 


GTGGCGAAAC 


CCGACAGGAC 


TATAAAGATA 


2280 


CCAGGCGTTT 


CCCCCTGGAA 


GCTCCCTCGT 


GCGCTCTCCT 


GTTCCGACCC 


TGCCGCTTAC 


2340 


CGGATACCTG 


TCCGCCTTTC 


TCCCTTCGGG 


AAGCGTGGCG 


CTTTCTCAAT 


GCTCACGCTG 


2400 


TAGGTATCTC 


AGTTCGGTGT 


AGGTCGTTCG 


CTCCAAGCTG 


GGCTGTGTGC 


ACGAACCCCC 


2460 


CGTTCAGCCC 


GACCGCTGCG 


CCTTATCCGG 


TAACTATCGT 


CTTGAGTCCA 


ACCCGGTAAG 


2520 


ACACGACTTA 


TCGCCACTGG 


CAGCAGCCAC 


TGGTAACAGG 


ATTAGCAGAG 


CGAGGTATGT 


2580 
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AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG 

ATTTGGTATC TGCGCTCTGC TGAAGCCAGT 

ATCCGGCAAA CAAACCACCG CTGGTAGCGG 

GCGCAGAAAA AAAGGATCTC AAGAAGATCC 

GTGGAACGAA AACTCACGTT AAGGGATTTT 

CTAGATCCTT TTAAATTAAA AATGAAGTTT 

TTGGTCTGAC AGTTACCAAT GCTTAATCAG 

TCGTTCATCC ATAGTTGCCT GACTCCCCGT 

ACCATCTGGC CCCAGTGCTG CAATGATACC 

ATCAGCAATA AACCAGCCAG CCGGAAGGGC 

CGCCTCCATC CAGTCTATTA ATTGTTGCCG 

TAGTTTGCGC AACGTTGTTG CCATTGCTAC 

TATGGCTTCA TTC AG CTCCG GTTCCCAACG 

GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC 

AGTGTTATCA CTCATGGTTA TGGCAGCACT 

AAGATGCTTT TCTGTGACTG GTGAGTACTC 

GCGACCGAGT TGCTCTTGCC CGGCGTCAAT 

TTT AAAAG TG CTCATCATTG GAAAACGTTC 

GCTGTTGAGA TCCAGTTCGA TGTAACCCAC 

TACTTTCACC AGCGTTTCTG GGTGAGCAAA 

AATAAGGGCG ACACGGAAAT GTTGAATACT 

CATTTATCAG GGTTATTGTC TCATGAGCGG 

ACAAATAGGG GTTCCGCGCA CATTTCCCCG 
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GCCTAACTAC GGCTACACTA GAAGGACAGT 2 640 

TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 2700 

TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 27 60 

TTTGATCTTT TCTACGGGGT CTGACGCTCA 2820 

GGTCATGAGA TTATCAAAAA GGATCTTCAC 2880 

TAAATCAATC T AAAG TAT AT ATGAGTAAAC 2940 

TGAGGCACCT ATCTCAGCGA TCTGTCTATT 3000 

CGTGTAGATA ACTACGATAC GGGAGGGCTT 3060 

GCGAGACCCA CGCTCACCGG CTCCAGATTT 3120 

CG AG CGCAG A AGTGGTCCTG CAACTTTATC 3180 

GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA 324 0 

AGGCATCGTG GTGTCACGCT CGTCGTTTGG 3300 

ATCAAGGCGA GTTACATGAT CCCCCATGTT 3360 

TCCGATCGTT GTCAGAAGTA AGTTGGCCGC 3420 

GCATAATTCT CTTACTGTCA TGCCATCCGT 3480 

AACCAAGTCA TTCTGAGAAT AGTGTATGCG 3540 

ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 3600 

TTCGGGGCGA AAACTCTCAA GGATCTTACC 3660 

TCGTGCACCC AACTGATCTT CAGCATCTTT 3720 

AACAGGAAGG CAAAATG CCG CAAAAAAGGG 3780 

CATACTCTTC CTTTTTCAAT ATTATTGAAG 3840 

ATACATATTT GAATGTATTT AGAAAAATAA 3900 

AAAAGTGCCA CCTG 3944 
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MICROORGANISMS 

Optional Sheet in connection with the microorganisms referred to on page 25, lines 14-27 of the description ' 
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Further deposits are identified on an additional sheet Kl 4 
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Agricultural Research Culture Collection (NRRL) 
International Depositary Authority 



Address of depositary institution (including postal code and country) 4 

1815 N. University Street 

Peoria/ IL 61604 

US 



Date of deposit * March 26, 1991 Accession Number * NRRL B-1 8790 

B. ADDITIONAL INDICATIONS ' (kave blank if no* applicable). This information U continued on a separate attached sheet 



C. DESIGNATED STATES FOR WHICH INDICATIONS ARE MADE ' 



D. SEPARATE FURNISHING OF INDICATIONS B (leave blank if not applicable) 



The indications listed below will be submitted to the International Bureau later * (Specify the general nature of the indications e.g., 
"Accession Number of Deposit") 



E. £fThis sheet was received with the International application when filed (to be checked by the receiving Office) 

(Authored Officer) (j 
□ The date of receipt (from the applicant) by the International Bureau " 



(Authorized Officer) 
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Date of Deposit 



40825 



June 5, 1990 
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Date of Deposit 
August 3, 1990 
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WHAT TR CLAIMED IS ; 

1. A DNA plasmid vector capable of 
replication in prokaryotic and eukaryotic cells 

5 comprising: 

a) a eukaryotic transcription unit of 

(i) the early immediate HCMV promoter region, 
and 

(ii) a generic polylinker sequence, and 

(iii) an SV40 splice/polyadenylation site; 

b) an SV40 origin of replication; 

c) the gene encoding resistance to an 
antibiotic, under control of a prokaryotic 
promoter; and 

d) a pBR322 origin of replication. 
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2. The DNA vector of claim 1 that is pCMX 
as deposited with the NRRL, having accession number 



B-1 8790 



3. The DNA vector of claim 1 further 
comprising a structural gene inserted in the 5' - 3' 
orientation within the polylinker sequence. 

4. The DNA vector of claim 2 further 
comprising a structural gene inserted in the S« - 3" 
orientation within the polylinker sequence. 

5. The DNA vector of claim 1 further 

30 comprising a structural gene inserted in the 3- - 5' 
orientation within the polylinker sequence. 

6. The DNA vector of claim 2 further 
comprising a structural gene inserted in the 3' - 5\ 

35 orientation within the polylinker sequence. 
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7. The DNA vector of claim 1 further 
comprising a structural gene which encodes NGF-myc 
inserted in the 5 • -* 3 1 orientation within the 
polylinker sequence . 

8. The DNA vector of claim 2 further 
comprising a structural gene which encodes NGF-myc 
inserted in the 5 1 -+ 3' orientation within the 
polylinker sequence . 

9. The DNA vector of claim 1 in which the 
gene encoding resistance to an antibiotic is a gene 
encoding /3-lactamase . 

10. The DNA vector of claim 1 in which the 
T7 phage RNA polymerase promoter is fused downstream 
from the early immediate HCMV promoter region. 

11. The DNA vector of claim 2 in which the 
20 T7 P ha< 3 G ™ A polymerase promoter is fused downstream 

from the early immediate HCMV promoter region. 

12. The DNA vector pCMX : NGF-myc . 

25 13. A DNA vector comprising pCMX : NGF-myc . 

14. A DNA vector comprising pCMX. 

15. A method of cloning a structural gene of 
30 interest comprising: 

(a) inserting DNA containing the structural gene 
into the plasmid vector of claim 1; 

(b) introducing the plasmid vector resulting from 
step (a) into a mammalian host cell which 

35 expresses the large T antigen of SV40; 
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(c) subjecting the host cell to conditions such 
that the structural gene is expressed by the 
host cell to produce a protein; 

"(d)- identifying the host cell by selection on the 
5 basis of one or more properties of the 

protein; and 

(e) isolating the host cell. 

16. A method of cloning a structural gene of 
interest comprising: 

(a) inserting DNA containing the structural gene 
into the plasmid vector of claim 2; 

(b) introducing the plasmid vector resulting from 
step (a) into a mammalian host cell which 
expresses the large T antigen of SV4 0; 

(c) subjecting the host cell to conditions such 
that the structural gene is expressed by the 
host cell to produce a protein; 

(d) identifying the host cell by selection on the 
basis of one or more properties of the 
protein; and 

(e) isolating the host cell. 

17. The method according to claim 15 in 
25 which the selection is carried out by a method 

comprising immunospecific binding of an antibody to the 
protein. 

18. The method according to claim 16 in 
30 which the selection is carried out by a method 

comprising immunospecific binding of an antibody to the 
protein. 
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19. The method according to claim 17 in 
which the selection is carried out by a panning 
procedure . 

5 20. The method according to claim 18 in 

which the selection is carried out by a panning 
procedure. 

21. The method according to claim 15 in 
1Q which the host cell is a COS cell. 

22. The method according to claim 16 in 
which the host cell is a COS cell. 

^ 5 23. A method of producing a protein 

comprising growing a mammalian cell, which cell (a) 
expresses the large T antigen of SV40, and (b) contains 
the DNA vector of claim 3, in which the structural gene 
encodes the protein, and such that the protein is 

2Q expressed by the cell. 

24. A method of producing a protein 
comprising growing a mammalian cell, which cell (a) 
expresses the large T antigen of SV40, and (b) contains 

25 the DNA vector of claim 4 , in which the structural gene 
encodes the protein, and such that the protein is 
expressed by the cell. 

25. A method of producing a protein 

30 comprising growing a mammalian cell, which cell (a) 

expresses the large T antigen of SV40, and (b) contains 
the DNA vector of claim 5, in which the structural gene 
encodes the protein, and such that the protein is 
expressed by the cell. 
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26. A method of producing a protein 
comprising growing a mammalian cell, which cell (a) 
expresses the large T antigen of SV40, and (b) contains 
the DNA vector of claim 6, in which the structural gene 
encodes the protein, and such that the protein is 
expressed by the cell. 

27. The method according to claim 23 in . 
which the cell is a COS cell. 

28. The method according to claim 24 in 
which the cell is a COS cell. 

29. The method according to claim 25 in 
which the cell is a COS cell. 

30. The method according to claim 26 in 
which the cell is a COS cell. 
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ACGCGTTBAC ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 60 

CATAGCCCAT ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC CGCCTGGCTG 120 

ACCGCCCAAC GACCCCCGCC CATTGACGTC AATAATGACG TATGTTCCCA TAGTAACGCC 180 

AATAGGGACT TTCCATTGAC GTCAATGGGT GGACTATTTA CGGTAAACTG CCCACTTGGC 240 

AGTACATCAA GTGTATCATA TGCCAAGTAC GCCCCCTATT GACGTCAATG ACGGTAAATG 300 

GCCCGCCTGG CATTATGCCC AGTACATGAC CTTATGGGAC TTTCCTACTT GGCAGTACAT 360 

CTACGTATTA GTCATCGCTA TTACCATGGT GATGCGGTTT TGGCAGTACA TCAATGGGCG 420 

TGGATAGCGG TTTGACTCAC GGGGATTTCC AAGTCTCCAC CCCATTGACG TCAATGGGAG 480 

TTTGTTTTGG CACCAAAATC AACGGGACTT TCCAAAATGT CGTAACAACT CCGCCCCATT 540 

GACGCAAATG GGCGGTAGGC GTGTACGGTG GGAGGTCTAT ATAAGCAGAG CTCTCTGGCT 600 

AACTAGAGAA CCCACTGCTT ACTGGCTTAT CGAAATTAAT ACGACTCACT ATAGGGAGAC 660 

CGGAAGCTTC TAGAGATCCC TCGACCTCGA GATCCATTGT GCTGGCGCGG ATTCTTTATC 720 

ACTGATAAGT TGGTGGACAT ATTATGTTTA TCAGTGATAA AGTGTCAAGC ATGACAAAGT 780 

TGCAGCCGAA TACAGTGATC CGTGCCGCCC TAGACCTGTT GAACGAGGTC GGCGTAGACG 840 

GTCTGACGAC ACGCAAACTG GCGGAACGGT TGGGGGTTCA GCAGCCGGCG CTTTACTGGC 900 

ACTTCAGGAA CAAGCGGGCG CTGCTCGACG CACTGGCCGA AGCCATGCTG GCGGAGAATC 960 

ATAGCACTTC GGTGCCGAGA GCCGACGACG ACTGGCGCTC ATTTCTGACT GGGAATGCCC 1020 

GCAGCTTCAG GCAGGCGCTG CTCGCCTACC GCCAGCACAA TGGATCTCGA GGGATCTTCC 1080 

FIG.1A 
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ATACCTACCA GTTCTGCGCC TGCAGGTCGC GGCCGCGACT CTAGAGGATC TTTGTGAAGG 1140 
AACCTTACTT CTGTGGTGTG ACATAATTGG ACAAACTACC TACAGAGATT TAAAGCTCTA 1200 
AGGTAAATAT AAAATTTTTA AGTGTATAAT GTGTTAAACT ACTGATTCTA ATTGTTTGTG 1260 
TATTTTAGAT TCCAACCTAT GGAACTGATG AATGGGAGCA GTGGTGGAAT GCCTTTAATG 1320 
AGGAAAACCT GTTTTGCTCA GAAGAAATGC CATCTAGTGA TGATGAGGCT ACTGCTGACT 1380 
CTCAACATTC TACTCCTCCA AAAAAGAAGA GAAAGGTAGA AGACCCCAAG GACTTTCCTT 1440 
CAGAATTGCT AAGTTTTTTG AGTCATGCTG TGTTTAGTAA TAGAACTCTT GCTTGCTTTG 1500 
CTATTTACAC CACAAAGGAA AAAGCTGCAC TGCTATACAA GAAAATTATG GAAAAATATT 1560 
CTGTAACCTT TATAAGTAGG CATAACAGTT ATAATCATAA CATACTGTTT TTTCTTACTC 1620 
CACACAGGCA TAGAGTGTCT GCTATTAATA ACTATGCTCA AAAATTGTGT ACCTTTAGCT 1680 
TTTTAATTTG TAAAGGGGTT AATAAGGAAT ATTTGATGTA TAGTGCCTTG ACTAGAGATC 1740 
ATAATCAGCC ATACCACATT TGTAGAGGTT TTACTTGCTT TAAAAAACCT CCCACACCTC 1800 
CCCCTGAACC TGAAACATAA AATGAATGCA ATTGTTGTTG TTAACTTGTT TATTGCAGCT 1860 
TATAATGGTT ACAAATAAAG CAATAGCATC ACAAATTTCA CAAATAAAGC ATTTTTTTCA 1920 
CTGCATTCTA GTTGTGGTTT GTCCAAACTC ATCAATGTAT CTTATCATGT CTGGATCCTT 1980 
ACTCCGCCCA TCCCGCCCCT AACTCCGCCC AGTTCCGCCC ATTCTCCGCC CCATGGCTGA 2040 

FIG. IB 
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CTAATTTTTT TTATTTATGC AGAGGCCGAG GCCGCCTCGG CCTCTGAGCT ATTCCAGAAG 2100 

TAGTGAGGAG GCTTTTTTGG AGGCCTAGGC TTTTGCAAAA AGGAGCTCCC AGCAAAAGGC 2160 

CAGGAACCGT AAAAAGGCCG CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC CCCCTGACGA 2220 

GCATCACAAA AATCGACGCT CAAGTCAGAG GTGGCGAAAC CCGACAGGAC TATAAAGATA 2280 

CCAGGCGTTT CCCCCTGGAA GCTCCCTCGT GCGCTCTCCT GTTCCGACCC TGCCGCTTAC 2340 

CGGATACCTG TCCGCCTTTC TCCCTTCGGG AAGCGTGGCG CTTTCTCAAT GCTCACGCTG 2400 

TAGGTATCTC AGTTCGGTGT AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC ACGAACCCCC 2460 

CGTTCAGCCC GACCGCTGCG CCTTATCCGG TAACTATCGT CTTGAGTCCA ACCCGGTAAG 2520 

ACACGACTTA TCGCCACTGG CAGCAGCCAC TGGTAACAGG ATTAGCAGAG CGAGGTATGT 2580 

AGGCGGTGCT ACAGAGTTCT TGAAGTGGTG GCCTAACTAC GGCTACACTA GAAGGACAGT 2640 

ATTTGGTATC TGCGCTCTGC TGAAGCCAGT TACCTTCGGA AAAAGAGTTG GTAGCTCTTG 2700 

ATCCGGCAAA CAAACCACCG CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC AGCAGATTAC 2760 

GCGCAGAAAA AAAGGATCTC AAGAAGATCC TTTGATCTTT TCTACGGGGT CTGACGCTCA 2820 

GTGGAACGAA AACTCACGTT AAGGGATTTT GGTCATGAGA TTATCAAAAA GGATCTTCAC 2880 

CTAGATCCTT TTAAATTAAA AATGAAGTTT TAAATCAATC TAAAGTATAT ATGAGTAAAC 2940 

TTGGTCTGAC AGTTACCAAT GCTTAATCAG TGAGGCACCT ATCTCAGCGA TCTGTCTATT 3000 
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SUBSTITUTE SHEET 



WO 92/17581 PCT/US92/02492 

4/7 

TCGTTCATCC ATAGTTGCCT GACTCCCCGT CGTGTAGATA ACTACGATAC GGGAGGGCTT 3060 

ACCATCTGGC CCCAGTGCTG CAATGATACC GCGAGACCCA CGCTCACCGG CTCCAGATTT 3120 

ATCAGCAATA AACCAGCCAG CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG CAACTTTATC 3180 

CGCCTCCATC CAGTCTATTA ATTGTTGCCG GGAAGCTAGA GTAAGTAGTT CGCCAGTTAA 3240 

TAGTTTGCGC AACGTTGTTG CCATTGCTAC AGGCATCGTG GTGTCACGCT CGTCGTTTGG 3300 

TATGGCTTCA TTCAGCTCCG GTTCCCAACG ATCAAGGCGA GTTACATGAT CCCCCATGTT 3360 

GTGCAAAAAA GCGGTTAGCT CCTTCGGTCC TCCGATCGTT GTCAGAAGTA AGTTGGCCGC 3420 

AGTGTTATCA CTCATGGTTA TGGCAGCACT GCATAATTCT CTTACTGTCA TGCCATCCGT 3480 

AAGATGCTTT TCTGTGACTG GTGAGTACTC AACCAAGTCA TTCTGAGAAT AGTGTATGCG 3540 

GCGACCGAGT TGCTCTTGCC CGGCGTCAAT ACGGGATAAT ACCGCGCCAC ATAGCAGAAC 3600 

TTTAAAAGTG CTCATCATTG GAAAACGTTC TTCGGGGCGA AAACTCTCAA GGATCTTACC 3660 

GCTGTTGAGA TCCAGTTCGA TGTAACCCAC TCGTGCACCC AACTGATCTT CAGCATCTTT 3720 

TACTTTCACC AGCGTTTCTG GGTGAGCAAA AACAGGAAGG CAAAATGCCG CAAAAAAGGG 3780 

AATAAGGGCG ACACGGAAAT GTTGAATACT CATACTCTTC CTTTTTCAAT ATTATTGAAG 3840 

CATTTATCAG GGTTATTGTC TCATGAGCGG ATACATATTT GAATGTATTT AGAAAAATAA 3900 

ACAAATAGGG GTTCCGCGCA CATTTCCCCG AAAAGTGCCA CCTG 3944 

FIG.1D 
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